MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
PROJECT MAC 



ARTIFICIAL INTELLIGENCE PROJECT *a-.a«-V»7 

AI Memo 97A " 



October 13, 1966 



SYMBOLIC INTEGRATION - II 
by Joel Moses 

In *:nis memo we describe the current state of the Inte- 

Familiarity with Memo 97 i 3 « aS umed. Some of the algorithms 
described in that memo have bu*n extended. Certain new algorithm 
and a si^le integration by.p„m routine have been added? The 

bv^A^inU"? "?• inCefiraCe aU !:h « P" b l«» which were solved 
dL to L fddiM tn % Cwo .?"*>?«« attempted by U and not solve. . 
of i„' - ! ddltlon 0f a <^i,ior. procedure y, B prcgram is caparie 

°n^, \ ", g ?-"!'" lnt **" nd » (««ch as e* of e'/x) ••"not 
xntegrabli in closed form. 

Methods of solution Co certain types of differential equal ons 
routine " J "*"' ^ "" ^ lo "«™^on program as Hut 

nu ^iif integration prolan has been dubbed SIN t'for Symbolic 
INtegr,tion.) Below vo preset the methods of integration used 

Hire R(z) and S(z) are rstio-ial functions (ratio of tvo o-lv 
noai.l,, inz sie»( a ) is an elementary function c£ s tfe* S^ 
page 6, for definition cf elementary function.) "*s>*. 



I. x & Ele»(x '), where c, V^ are integers, and wh&rt 
k ■ g.c.d. of the k ( - divides (c + 1). 

Substitute y * x.* 

l ,x 7 /(x A + l)dx becomes 1/4 j y/ (y* 4 I)dy 

('l/x t£*~-~~~J dx , beccsnes 1/2 ( ty - i dy 



II, (Rational) R(x). Chains to a program written by Manave 

which integrates rational functions by a method described in 
Hardy $ 

III. Elem(b3se , base, , ...), where base', a-, b ( ' , are 

constants. 



a. base; is converted to base. * f -«•«»* 

x 

b. y ■ base 1 is substituted 

ef{2 + 36* )dx becomes U/(2 + 3y A )dy 

r 



r p %r 

|10 e^dx becomes \ y f $* dy 



IV. (Chebyschev) Ax (c + c x* )• AjT,^ ^,0. t c^ are constants. 
Convert A to A/q, let r ■ r -f 1 — 1 

q 

a. r^ integer, i^ > 
Substitute y » c~"+~ e A x> 

b. rj integer, r. rational number with denominator d. 

ajj 
Substitute y * x*^- 

c. r, integer, r, < 0, r 4 rational with denominator d, 

~ A ; in * 

Substitute y » (c A + c^x 4 )'^ 

d. r^ + r a is an integer . . 
Substitute y «= /c^ + c^x^ *> /^j. 






V. F.lea (x, ->^ ax + b , -V e x 4 b , ...) vner« me n, ate rcti._T.al nur/Deis . 
»':x +o • ex •*- rt K,n,c.. d are' LW,i cants with zd-^c^. 

I<et k & least cor.xaon ttu ^LipH-. .;f. tae n ( * 

Substitute, y - -VjRTT 
V ex -f o 



Icoc va dx beccties 2y cos.v.dv 
(x ",oT+ 1 dx be " ones j2(> - l)y J 



v 



VI. R(x f 1a + bx 



a. a > 0, b > Substitute can(y) = Ya/b x 
jl/(4 + x^dx becomes jl/2 sec(y)dy 

b. a > 0, b < C Substitute sin(y) - Va/b x 
' t x/(l - x ) A dx becomes Jsin(y)dy 

c. a < 0, b> Substitute sec (y) *-*V-a/b x 



VII. R(x, Tcx^ + hx + a) 

a. c > Substitute y = *VT x +7cx* ? + bx + a 

b. a £ Substitute y -A'cx" 1 + bx + a - Va)/x 

/■ 
VIII. R(x) F(S(x)), where ' R(x}dx 13 rational 

a- F = log . _ 

Solution is log(S(x)) JR(x)dx - jR(x;dx I S (>:)/S (x)rix 

x log(x)dx becomes 1/2 x* log(x) — x/2 dx 



b. F c arctan 
Solution i 

c . F = arcs in 



F = arctan r ,.-. n ^ 

Solution is arctan (S(x)) R(x)dx - I R(x)dxlS (x)/(l + S (x))dx 

F = arcs in r /-__ , 

Solution is arcsin(S(x)) ,R(x)dx - !j R(x)dx S(x)/(1 - S' 3 (x)) J dx 

/.« ^ ^ r "* ' i ^' 

'x arcsin(x)dx becomes 1/3 x arcsin(x) - ' 1/3 x v /(l - x ) J dx 



IX. R(x) Elem(log(a + bx)) 

Substitute y ■ log (a + bx) 



!log(x)/(l + log(x)) dx becomes ■c*' y/(l + yf dy 



X. Klem{F (a + bx)) where ths F; are cr is- or, owe : r It; 'unctions. 

t i 

I. Preparatory step y « a ■*■ b>- yielding 1 /b Elem(F-(y)) 

Note: Some cede has been written for the case in wnich 
the arguments of the Li igonomt trie functions arc linear 
but not identical. 

II. fsin**(y)JOS ft (y)Cy 



a. m < n; transform into ] (1/2 sin 2y) (1/2 + 1/2 ccz 2y) dy 



/* 



./; 



b. m 2 n; transform into ']{!/?. sin Jly) (1/2 - 1/2 cos 2v)~"tly 
Icos (x)dx becomes ;(l/2 + 1/2 cos 2y)dy 

III. i) Trans form all trigonometric functions into sine- 3 and 
cosines and test to see if the result is of the form 

a. sin (y)Elem(sin (y),cos(y)) 

Substitute 2 - cos (y) 

i sin (y)cos(y)dy becomes . (z - z^ )dz 

b. cos* (y) Elem(cos (y), sin(y))dy, substitute Z=sin(y) 

ii) Transform all trigonometric functions into secants anJ 

tangents and determine if the resulting expression is of 
the form 

a. Elem(tan(y), sec* (>-}), substitute z ■ tan(y) 

(sec*(y)/(l + sec*(y) - 3 tan(y))dy 

r i 
becomes J!/(z*- 3z + 2)dz 

b. tan (y) Elem(sec(y), tan~(y))dy, 
substitute z * sec (y) 

iii) Substitute z * tan 1/2 y 

XI. R(x)e where P is a polynomial function of x. See below for a 
description. 



XII. a. In a sum each of the summands is integrated separately. 

b. If the integrand is a small, positive integer power of a :um, 
the integrand is expanded and the result is integrated. 

XIII. (Derivative Divides) Sec below for description. 

XIV. (Integration by Parts) See below for description. 



Procedure 



egrating R(x)e - Algorithm X 



In the previpua memo we mentioned a derision u^oeednrn r«r- <„- ^ 

o< t.u« <v,~ c /,u'!<i • - . , ei * iiU -' p.ocecure ior integrands 

of the fcias R(x)e vnere R is rations! and p is a t'non-ronstar.t) -,oV 
nomlal, This is an extension of the decisis procedure found in Ri'-t^'cr 
th* case l(x) - x. Ths algorithm proceeds by making guesses at the ^o^ton 
This is the way the Edge heuristic would have proceeded. We ha-e no^if-r^* 
the heuristic slightly co arrive at this procedure. 

Let R(x) * gx '+ S,(x) , where ^ Q are polynomials 

Q(x) s 1 is a polynomial of degree < m 

C^ is a constant 

Plx , We know from Ritt that the integral (if any) will be a ir.ult<pl- of 
e . Suppose the integral is represented by (a/x) + h } (x))e Pl fthctx ~ 
p'te)^ + <£ + ? / (x)b 1 + b^ R( X ) - ac%_Sj 





<i 


if we gaess a t (x) » 


' P< x m ' then 


P'O 


- c, X^P 



and P bj + bj 



R(*) - P a A - a i 



A 



S i" m » c > * + gj x fr + Cvx Q 

p P >A ?'(} _ 



Now, consider the numerator of p'b,* b / as a polynomial T, (x) 

^ r a ."f U T a .- w 0n reTn3inder BlW. Let the leading tern of i(x) 

be c A x Contmvs the process indicated above until some T7 <T r , sav) 

* f at off/r lme the corresponding U; (i.e. u,> is also 0, 'then ^he 

expression R(x^'is integrable and the integral is the sum of hea (x 
multiplied ay e* If U, is not 0, then the expression is not intcrible 



Note that I: U„ * 0. then R(x) - p'< a<- - £«. 



0. 



;*j. 



i-i 



Let a « ^_a' , tien we claim that U^ - implies that 
i(x)e' U) dx - a(>*,e W , or 

aP' e** + a' e^ « R*:^ 

(R - aP'- a')J>° 

but e^ 0, henc- R - a p'- a' « 0, but this is assured. 



Examo le s 



r x 
« e 



~ — . dK = __L- * l e d * is not intestable 
(x4 " ] -^ x+J- -Vdx is not intejzrable 



Derivative Divides - Algorithm XIII 



Let the integrand be f(x;g(x) where £(>:) j. is ecch of the non- 
constant terms in the integrands considered in turn of the whole 
integrand if it is not a product of terns. 

a) If f (x) ■ cons tan f*g(x) , then »f (x)g(x) « f. (x) 

' 2*constant 

b) If f is a unary operator such a3 sin, cos and its 
argument is k(x) and k'(x) *■ cons tan t*g (x) then substitute 
y * k(x) and obtain Jl/constant f(y)dy by looking it up 

in a table. This table contains about ten entries. 

c) If f is a binary operator (i.e., expt or log) then k(x) 
is considered to be the non-constant argument (assuming 
only one argument is non-cons tan c. ) . Proceed as with step 
b above. 

This routine is rather conservative in searching for a subexpression 
whose derivative divides the integrand. Since it is applied to each 
problem, our main considerations were speed and infallibility in making 
substitutions. Considering its limitations, it is a highly successful 
routine. For example, it helped solve forty-five out of the eighty-six 
S/.INT problems with an average time of 0.6 seconds on the 7094. 



Eiamole 



ran r. sec x dx = 1/2 tan x 



i 

|x n dx * l/n-*l x 

(cos (ax + 1 )dx » 1/a sin(ax + b) 



cos x dx « e 



At-i 



Integration by Parts - Method XIV 

At the present time the Edge heuristic discussed in our 
previous msmo has net been written. We have, however, written 
a simple integration by parts routine similar to the one found in 
SAINT. The organization that this routine requires is closer to the 
organization found in SAINT than any other part of the program. 

Consider any partition of the integrand into a product of non- 
constant factors g and h, where H » I hdx can be found without call- 
ing tha integration by parts routine, (This is a modification of the 
SAINT method vhich required that the integral be found with an allott- 
ment of no resources.) Now consider Jg>Hdx. We require that this 
integral be found with the number or calls to the integration by parts 
routine fewer than a constant called Maxparts* This constant is 
calculated to bv. twice the maximum absolute value of an exponent of 
a top level term of the integrand. Thus, xcos (x) may use only two 
calls to integration by parts and x^cos (x) may use four. 

If both integrals can be obtained, then the solution is 
fghdx « gH - (g'Hdx. 

There are some obvious difficulties with the approach taken 
above. First, there is no attempt to devise a reasonable partition 
of the integrai.d. Second, there is no routine which determines 
wrether progress has been made in obtaining a solution. Both of these 
p* oblems must ^e solved in a successful implementation of the Edge 
heuristic. 

Note alsc that the value of Maxparts can be set only at the top 
level call to the integration by parts routine. Otherwise, we would 
find up in an infinite loop. This requires the program to determine 
whether it is at the top level call to this routine. Although it is 
debatable whether we really require such a top-level determination in 
this routine, we predict that complex programs will increasingly make 
use of information regarding the history of the program. For a further 
discussion of this point see Teitelman, 

Example 

Jx cos (x)dx ■ x sin(x) - jsin(x)dx 



I; if fer.nt lal Equation* 



R'snizir.r* for soiv-'.i; :he following classes ri differentia L 
e-JU3V toot hav* b-^er, written. 'fr.-e i-: Terra: inn t.ro£r;;:n is calleJ 
as 2 sub roue inr> in eac! 



< ^ - ~< ~ t* 



a) sparable 

A(x)B(y)i:< -f C<;:)D(y)dy « 

lA/C dx 4- fc/P: dv "0 
• J 

x(y*- i)dx - y(x A - I).vy » 

becomes fji ^ x ~ i v /(y^-' D<iv * C 

b) Linear 

/+ ?(x)y x j(x) * . 

becomes ye - •+- l()c'' w dx * : C 

y * c 

y' + y -i- x = ueccrr.ac ve*T y/.a* <ix « C„ 

•;) Bernoulli 

y' + P(x)y + Q(x}y 1 « 

Substitute v = y~ n yielding the linear equation (case b) 
v' + (l-n)F(x)v -r (l-n)Q(x) « 

c) Exact 

P(x,y)dx + Q(x,y)dy = 

where j2_P a ^ (The matching program SCHATCKEu is 

o y <5 x 

used to determine the equivalence.) 



becomes JPdx ■ + J(Q - ^ ffcdxpdy * C ; 

2xydx + Cx + cos(y))dy -- 
becomes x a y + (cos (y)dy » C & 



e) Homogeneous 

P(x,y)dx + Q(x,y)dy - 

where P and Q are homogeneous functions of the sa:ae degree, 
n, say. Substitute y » vx and factor out x" from the 
equation. Substitution yields variables separable (case a). 
(2ye' - >:)dy -f (2x + y;dx ~ G 
becomes (2vc - ! )xdv 4 ,?(v*e"+ 1 }d\ - G 



f) Liutar cosf i'ir. Lznis 

(rr+bT+ry «..b..,..,,c, or* co^um* 

ar.c s' b - ab'f 
substitute x* « x - b'^^bc'. y* * y . £c > . 6 ' c 

and obtain a homogeneous equation (case e). 

y - 2x + y -T " ° becwn «« (2x* 4 y*)dy* - (4x - y*)dr* * c 

g) Linear differential equations with constant coefficients 
L. Ernst has recently written a program which solves 
certain cases of inhomogeneous differential equations 
with constant coefficients. The method used requires 
Laplace transforms which are obtained by a program 
written by C. Englema.n*. The latter program uses the 
above-mentioned program by Manove for integrating 
rational functions. We hope to use these programs as 
subroutines. 



Extensions to SCHATCHEN 

Two new modes have- be<_n scded to the m:-. ccr.in£ profit art 
SCHATCHEN. These have baeq fouuu u^*rui i (l coma in si tuaticn: -. 

.Consider the expression A(::}B(y>dx 4 C (:.)D (: }dy .{-c.-nc?. .«.n 
the section on iif ferential equations when discus: 3 ins sepciraMt. 
equations (case a). Vnat is desired here is that both A and 3 be 
a product of terms involving x and y respectively. T.t wss not 
previously possible to describe to SCHATCHEN a c imp it matching- 
rule which would yield a subset of die terror in a product (c. ; £c 
that matter » in a sum) . The indicators TIMES** and PLUS"** are 'usee- 
for this purpose. For instance> the expression above can be r:R».ched 
using the following pattern: 

(PLUS (TIMES (TIMES** A FREEY) (TIKES** B FREEX)DX) 
(TIMES (TIMES** C FREEY) (TIMES** D FREEXJDY)} 

where FREEX and FREEY are functions which test expressions for 
independence of x and y respectively. 



REPLACE 



A complementary function to SC&YTCIIEM has ber-n written wV;^h 
performs the function of a right-hand- side cf l Ccwit ruie~ that " 
is, it generates ar, expression by replacing in a form the'vallies 
obtained from a dictionary supplied by SCKATCHLM-. REPLACE it \ 
function of two arguments Diet and S:cpr. Diet "is a Usc^f pairs • 
Expr is an expression to be transformed, if we 1st Diet b^ 
<-(u, .v, )(u A .v A ) ... ( Uj| .v rt », than the nest itraight forward use of 
REPLACE is to substitute v^ for each occurrence of xi{ in P;*or 
REPLACE will also attempt tc simplify the results "obrn^ned'^u-in* 
tne substitution as the new expression is beinp. formed. It avoids 
certain redundant simplifications b-/ simplify nip oniv at the level 
at which it is working and not at lower levels / (These are dssuin-d 
simplified through the recursion process.) REPIACE also uses two" 
special indicators suggestively called Eval* and Quote*. 
REPLACE (DICT(EVAL* A)) is the simplified result of 
(EVALCREPLACE DICT A)ALtST) . REPLACE (DICT (QUOTE* A)) is simo-ly A. 
By using these indicators we can get some of the dynamic capabilities 
of CONVERT which are lacking in most other string transformation 
languages. 



Example 

(REPLACE 

(QUOTE (CA. - 1) (N,l))> 

(QUOTE (PLUS X (TIMES A (EXPT XN))P) « 

(REPLACE 

(QUOTE - «A.1)(B.1){X.Y») 

(QUOTE (TIMES X (EVAL* (FACT (PLUS A 3)))))) » (TIMES 2 Y), 

where FACT is the factorial function. 



1.:' 



"i. r\i.L?.,i: 



.«L -wivi.: !>y:*:vc'i.;r: 



1 . 



2) E'v^'^i C, ^.;-'ai.»v;> . .{., »>1 ur.'rri, - . , riaricnal Factions 

^:,. •■ -.IziLcw f.. t ;;;L-fJ.' , ..".-::s . 'Oi--:- U-- '.;.■ ? tier terser. , 1 c ^ , 

3 ) Hi rely . G . H . , 7\\_» I .-; z v; f , r c t i c r:t ci 5u;:ct i c r-. c f «; Sir, ? le 

y«jjj-±i^» - l)C * 2rf«> Cii;;j£idi;e U . I r tis 3~avab7I7gL , inp land , 
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